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ABSTRACT 


Random polygons were used as stimuli in a two-choice 
multiple discrimination learning paradigm designed to test 
individual learning ability. Information processing rate 
(IPR) was used as the measure of learning ability. Variables 
in the test design were: racial group (white, nonwhite), 
pacing mode (self-paced, machine-paced), and stimulus simi- 
larity. Subjects were 121 white and 39 nonwhite male Navy 
recruits. Over 10 trials, a learning effect was demonstrated, 
with internal (split-half) test reliability of .84 overall. 
White performance was superior to nonwhite only in the 
machine-paced mode. Significant correlation between IPR and 
Navy General Classification Test (GCT) scores was seen for 
the entire group, but was present only in the white subgroup 
when the sample was divided by race. Stimulus similarity did 
not prove to be a significant factor. It was concluded that 
a reliable, culture~-free test of general learning ability was 
practicable, although its validity with respect to on-job 


performance has yet to be established. 
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TOL EMENT OF THE PROBLEM 


Personnel selection and classification have long been 
areas of vital interest to the military. The accelerating 
advance of weapons technology in this century has vastly 
increased the personnel requirements of all branches of the 
armed forces. Such advance, when coupled with recent reduc- 
tions in force and manpower levels, have served to place yet 
more emphasis on selection and training. 

Bmtry of the United States into World War I, with its 
concomitant massive personnel classification and training 
requirements, saw the development of and introduction of the 
First group intelligence tests designed for military use--the 
Army Alpha and Beta Tests. Nearly two million men were given 
these tests in the course cf the war, and these results pro- 
vided much of the data base for studies of ethnic, racial, 
and other cultural differences in intelligence and ability in 
subsequent years (Matarazzo, 1972). New demands for skilled 
manpower brought about by the outbreak of World War II led to 
the adoption of the Army General Classification Test (AGCT), 
also a group intelligence test designed specifically for the 
Peet tary. 

Postwar developments and refinements in the field of 
military personnel testing and classification followed the 
general form of the earlier efforts. Emphasis on group 


testing of general intelligence and abilities continued, as 





evidenced by the heavy use of the Armed Forces Qualification 
Test (AFQT) by all services until only very recently. Cur- 
rent policy is typified by the Navy's Basic Test Battery 
(BTB), which seeks to measure not only general intelligence, 
but also arithmetic reasoning ability and aptitude in spe- 
cific areas such as mechanics and clerical work. 

Assignment of Navy recruits to technical school training 
is currently made primarily on the basis of performance on 
certain component parts of the BTB, and failure to attain the 
requisite "cutoff" scores for a given school is basis for 
denial of advanced training in that specialty. The Navy 
maintains an ongoing study of the validity of BTB scores as 
peedictors of school performance (Thomas, 1972a, 1972b). 

Recent emphasis on racial and cultural imbalances in 
group tests of intelligence and abilities, prompted in part 
by Federal legislation designed to eliminate irrelevant bias, 
has led to renewed investigation of all aspects of personnel 
testing and selection. In addition, efforts to upgrade the 
overall quality of Navy personnel in the face of force and 
manpower cutbacks and the loss of the draft have pointed up 
a new approach to the problem with emphasis shifting to human 
development and training rather than selection alone. It can 
be anticipated that the current objective of a smaller, 
better-trained Navy in the near future will only increase the 
demand for adequate prediction of performance in training and 
on the job. Inherent in this demand is the minimization of 


needless losses to the selection program of people who may be 





capable and trainable, but who lack the verbal or cultural 
background necessary to good performance on group- 
administered, paper-and-pencil, intelligence or aptitude 
tests. 

A great deal of controversy and theory surrounds the 
discussion of the nature of human intelligence and innate 
Peolity. Eiforts by Binet early in this century to quantify 
intellectual development levels led to the definition of the 
Intelligence Quotient (IQ) by Wilhelm Stern in 1914. In sub- 
sequent years, a variety of tests of human intelligence has 
emerged. While on the whole valid predictors of academic 
performance, most of these tests rely on an individual's 
level of intellectual development as a basis for determining 
"antelligence” (Matarazzo, 1972). Variance in environmental 
@eecultural opportunity within the United States renders such 
measurement of intelligence vulnerable to the criticism of 
racial or cultural bias. Placement of emphasis on acquired 
knowledge in measuring intellect will invariably result in 
continued questioning of the validity of those measurements 
when applied to disadvantaged segments of the population. 

Resolution of this problem is complicated by the inter- 
action between inherent ability and environmental opportunity 
in determining an individual's intellectual development. 
While inherited or innate ability sets limits on this 
development, exposure to environmental factors which foster 
growth determines to a large degree the level actually 


MeEaArned. scattel | *shtlyo3) derinition of fluid and 





crystallized intelligence, Hebb's (1972) treatment of intel- 
ligence A and intelligence B, and Jensen's (1968) definition 
of Level I and Level II intelligence exemplify recent 
attempts to explore the original ideas of inherited and 
acquired intelligence. 

The impact of cultural differences on general intelli- 
gence tests is easily perceived, if not measured. "Achieve- 
ment" testing of intelligence is seen as susceptible in many 
ways (and varying degrees) to these differences, and the 
resultant bias in scoring can lead to over- or under- 
Miediction of performance or aptitude for minority groups 
/Migomas, 1972c). The existence of cultural or racial bias in 
the Navy's BTB has been identified by "in house" study 
(Stephan, 1973; Thomas, 1972c). Nonetheless, the BTB is 
maintained as the primary enlisted personnel classification 
tool, largely due to demonstrated high validity in predicting 
@oaemmacal school grades (Thomas, 1972a, 1972b). Current 
school assignment policies within the Navy show, however, 
imencasing concern with utilization of qualified minority 
personnel. This concern is echoed by recent emphasis on 
Henmecaty recruiting throughout the nation. This avowed 
objective of attracting and training qualified minority group 
people places still heavier demands on the selection and 
classification processes in the Navy to be both valid and 
unbiased. 

The key to successful utilization of personnel within the 


highly technical environment of today's Mica elec oem 
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training. The worth of an individual to the Navy can be 
directly tied to his or her ability to acquire the knowledge 
and skills of a given job specialty or rating. More gener- 
ally, this can be interpreted as the ability to learn. The 
sole purpose behind development and administration of the BTB 
is in predicting performance in a training environment in the 
hope that this performance will relate to actual job perform- 
ance in the Fleet. While a great deal of data has been col- 
lected on school performance and initial screening scores on 
selection tests (Thomas, 1972a, 1972b), measurement of actual 
job performance in the Navy has proved extremely difficult. 
In addition, those job performance measurements which have 
been made to date have relied upon supervisory rating of per- 
formance, a methodology which has shown very little validity 
in other studies by other services (Fox, et al., 1969). 

Basic Test Battery scores are indicative however, not of 
aeertcuce for training so much as acquired knowledge or 
experience. In addition, emphasis is on verbal or academic 
Material such as found in the school environment. The impact 
of -this emphasis is not readily apparent until viewed in 
light of the heavy reliance on on-the-job training (OJT) in 
the Navy. 

Completion of advanced technical schooling is, of course, 
a prerequisite for successful performance of complex technical 
tasks in the Navy, but almost without exception, extensive OJT 
is necessary before an individual can perform his or her task 


effectively. in the Fleet. The major portion of this training 
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is conducted under actual operational conditions, with 
emphasis on learning-while-doing and observation of skilled 
technicians at work. Accordingly, a selection device must 
accurately predict ability to learn in this environment as 
well as in the school situation. 

In summary, then, it would appear that critical issues 
in the Navy's selection program center about three key areas: 
(1) Response to the characteristics and com- 

position of the current and projected 
recruit manpower pools, 

(2) Development of selection devices or 
concepts which reflect the native 
ability of the individual, and 

(3) Emphasis on learning and performance 
in an operational environment rather 
than in the "schoolhouse" alone. 

It would seem that some general indicator of an indivi- 
dual's ability to learn would be of benefit to the Navy's 
Seamen Lor valid predictors of job performance. Indeed, a 
recent recruiting document lists "good ability to learn" or 
"above average learning ability" as requirements for success 
in Navy technical specialties (USN, 1973). Where success is 
dependent upon training, the ability to respond to this 
training--in nonverbal as well as verbal areas--stands as an 
essential attribute to be measured. 

The primary objective of this study was to design, con- 


struct, and evaluate an objective test of individual learning 
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ability using nonverbal instruments. The guiding concepts 
for the test dictated that it be relatively easy to admin- 
ister, as culture-free as possible, and as far removed as 
possible from current paper-and-pencil "aptitude" intelli- 
gence tests now in use. The resultant test was concep- 
tualized as a nonverbal supplement to the current test 
battery, not as a replacement for established tests for 


academic aptitude. 
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Peele Oe VE LOE MnN aT 


A. DISCRIMINATION LEARNING AS THE TEST TASK 

Discrimination Learning (DL) tasks have for many years 
been used as fundamental tests of intellectual development 
levels. While rooted in animal behavior study, DL has been 
used in countless studies of human learning processes. 
Extensive use of DL in the fields of developmental and 
abnormal psychology has provided a basis for its application 
in studies of adult human learning processes as well. 

The relative simplicity of the majority of existing DL 
tests and techniques (owing largely to the design of such 
tests for animals, children, or retardates) renders them 
inapplicable to the measuring of adult human learning ability 
(Green and O'Connell, 1969). Nonetheless, the basic nature 
of a DL test--that of a performance test that relies upon the 
femepey to learn to distinguish one item from another-- 
justifies investigation into its possible applications to 
testing human intellect. 

Tests of job-related skills, while necessarily perform- 
ance tests in themselves, as a rule provide only a narrow 
view of a testee's aptitude. Little or no attempt is made 
to measure actual intelligence or learning ability. Rather, 
these tests tend to be oriented toward measures of physical 
Or perceptual motor skills peculiar to a certain task or 


mbtcncde While the narrow field Of Concentration Of such tests 
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improves their validity within that field, few are generally 
applicable to a wide range of skills or training programs. 

Introduction of DL test technigues into this area of 
personnel selection would provide measures of learning ability 
heretofore unavailable in reliable form. While tests in use 
meday Can provide accurate indication of an individual's 
tactile sensitivity, kinesthetic sense, dexterity, reaction 
speed, etc., no absolute measure of. learning ability is found 
that is based on actual performance testing. 

Recent investigation of verbal DL by Grey (1971), Baltutis 
Mae?) , Arima and Grey (1972a and 1972b), Bugarin (1973), and 
Arima (1974) provided a great deal of insight into the dynam- 
ics of serial DL of sets of verbal stimuli by adults. 

These studies employed the premises of information theory 
as it relates to information presentation rate and information 
content. The use of verbal material as stimuli in these 
studies severely restricts their applicability to a test of 
general learning ability. The highly cultural and cognitive 
aspects of verbal materials render them virtually unusable in 
cases where subjects are drawn from a culturally diverse 
Beeetation. 

Further, strong scientific evidence of physiological 
Specialization within the human brain suggests that verbal 
material is not processed in the same fashion nor in the same 
brain areas aS nonverbal material (Ornstein, 1972). Indeed, 
even the memory process may display this same specialization, 


reserving one brain center for the retention of visual 
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(pictorial, scenic) information, and another for processing 
Menguistic Or verbal material (Haber, 1970). Tests involving 
verbal stimuli and/or processing, including many paper-and- 
pencil tests currently in use, simply do not reach a great 
proportion of an individual's abilities. Others, such as 
pattern analysis tests, may only touch on these areas. Yet 
many of these abilities are vital to the effective performance 
of tasks in the Navy. 

Peeceturn to nonverbal stimuli and responses, as employed 
in a great amount of DL experimentation, is thus seen as 
imperative if the ideal of a truly culture-free test of native 
learning ability is to be maintained. The problem is then 
perceived as that of developing a nonverbal replica of the 
models used in earlier studies of verbal discrimination learn- 
ing. In this framework, DL tasks become more complex in that 
multiple discriminations must be learned concurrently, making 
the total task more applicable to the measurement of human 


meamming ability. 


Jey fool CONSTRUCTION 

Restriction of a nonverbal DL test suitable for adminis- 
tration to a culturally diverse population of human adults 
begins with the investigation of the information content 
involved. Verbal DL studies demonstrated the importance of 
the information presentation rate in the learning process 


feelbeucis, 1972; Bugarin, 1973) . 
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1. Information Measurement Considerations 

Ouanmertication Of the information presentation rate 
is possible in the case of DL in that stimuli are presented 
Mamaiscrete categories. Application of information theory 
to a set of discrete choices between discrete stimuli provides 
an absolute measure of the amount of information contained in 
each choice. When the initial probability of choice for each 
alternative in a DL stimulus set is known, information theory 
permits measurement of the reduction of response uncertainty 
over repeated exposures to a given stimulus set. In the case 
where each alternative in a stimulus set is equally likely to 


be selected, the information content of that set is given by: 


I = log, (N) 

where 
I=" intormat One cOontcener ne bulrs 
N= mumbernrctocalcornaeives 


(choices) in a stimulus set 
Thus a stimulus set containing two equally likely choices 
(items) contains one bit of information, a four-choice set two 
bats, and so forth. Stating the concept in other words, a 
subject can be said to process one bit of information when he 
selects an alternative from a set of two equally likely sti- 
muli presented simultaneously. The mental process implied in 
this activity is the reduction of uncertainty involved in 
choosing the correct stimulus for response. 
2. The Verbal DL Model 

Gray (1971) directly related learning speed with the 

rate of presentation of stimulus information, and the subse- 


eeme work by Baltutis (1972) and Bugarin (1973) further 
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confirmed the relationship between IPR and learning perform- 
ance in a verbal DL task situation. Of greater importance to 
this study, however, is the implication that the verbal DL 
model can be applied to the measurement of general learning 
eel cy . 

Construction of a visual DL test along the lines of a 
verbal DL test necessarily centered about the location and 
selection of suitable stimuli. The nature of the basic test 
model required a fairly large number of distinguishable sti- 
muli that were as free of cultural influence or implications 
as possible. 

3. Stimulus Materials 

The basic discrimination requirement for the test was 
eae -rmined to be that of shape or pattern discrimination. 
Avoidance of physiological complications, such as color blind- 
ness, further restricted the nature of the stimuli by elimi- 
meme Size and color as discrimination factors. For these 
reasons, two-dimensional, black-and-white patterns of uniform 
size were investigated. 

The need for a relatively long list of dist dageenaone 
shapes would eliminate the use of baSic geometric shapes as 
used in many other visual DL experiments. The desire to avoid 
culturally-oriented stimuli would also eliminate employment of 
so-called "familiar objects." 

Patts. ce al. (1956) ianvestigqated. the IMpis caries 
and construction of metric histoforms. This work was paral- 


leled by that of Attneave and Arnoult (1956) in that both 
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teams were concerned with the generation and informational 
cepects Of random two-dimensional figures. It was felt that 
this area provided the greatest promise of suitable stimuli. 
Preliminary research into these forms has led to the selection 
of a set of 30, two-dimensional, metric polygons generated by 
the method of Attneave and Arnoult (1956) and listed ina 
Study by Arnoult (1956). These items are presented in Figure 
Le 

Evaluation of the information content, subjective 
Similarity, and other attributes of the polygons was deemed 
necessary prior to actual construction of stimulus lists to be 
used in testing. Of major concern were the possible effects 
of intra- and inter-item similarity among figures, resemblance 
to familiar objects (association value), relative complexity, 
etc., as well as possible unforeseen preferences on the part 
of any subject for a given polygon over another in a forced- 
choice situation. This concern was generated by the depend- 
ence of the information content of a choice situation on the 
Probability of selection of one item over another. 

In order to gain some insight into as many of these 
factors as possible, an initial experiment was conducted. The 
purpose of this preliminary study was to discover any signifi- 
Same ctencency on the part of a group of subjects to choose one 
Sermulus item over another upon initial naive exposure to a 
Maar of polygons. In addition, some measure of the degree of 
subjective similarity between items presented in pairs was 


sought. 
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FIGURE 1. Shapes selected for use in assembling 
stimulus lists. 


(Feom Arnowlt, 1956) 
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fa Experiment 1 

The 30 stimulus polygons were arranged in pairs. All 
possible pairs were constructed under the constraint that an 
item would not be paired with itself. Left-right order within 
a given pair was not considered. This resulted in the assem- 
bly of (30 X 29)/2 = 435 different pairings. These pairs were 
then arranged in three columns on sheets. Three separate 
booklets, each containing 145 pairs, were constructed and dis- 
Pemeuted to 60 graduate students at the Naval Postgraduate 
School. Each subject received a single booklet selected at 
random from the three, and was asked to perform two separate 
tasks--selection of one item from each pair and rating of the 
degree of similarity seen between the items of each pair. Sub- 
jects were told that one item in each pair had been arbitrarily 
designated as "correct," i.e., the desired response, and were 
asked to designate that item which they thought to be the 
"correct" response. This selection was to be made with the 
knowledge that designation of the "correct" response was made 
Gemovetely arbitrarily. 

Subjects were cautioned to make their choices solely 
on the basis of a given pair alone, and without regard to pre- 
vious selections. This exercise was intended to simulate as 
closely as possible the condition of facing a stimulus pair in 
a forced-choice situation with no prior knowledge of the cor- 
meet item in the pair. 

Subjects then went through the list a second time, 


rating each pair as to whether the two items in each appeared 


21 





to be very Similar, slightly Similar, or dissimilar. Each 
Paeeewas then assigned a similarity factor of one, two, or 
three, respectively. Full instructions for both tasks, as 
printed on the booklet covers, are presented in Appendix A. 

The choice preferences of the 60 subjects (20 for 
each set of 145 pairs) were translated into percentages and 
cast into a matrix (Table 1). In addition, averages of 
Similarity ratings given for each pair were computed and cast 
into the same matrix format (Table 2). Thus pairwise esti- 
mates of choice preference and item similarity were obtained 
and placed in usable form. 

eee Construction of Test Stimulus Lists 

A subgroup of pairs was selected from the original 
435 that had been rated. These pairs were singled out on the 
basis of choice preference. Subjects making choices within 
these pairs had displayed no significant preference, on the 
average, for either item in each pair (selections were distri- 
buted either 50-50% or 45%-55% between each). This subgroup 
was then used to construct stimulus lists for DL testing. 
Since no marked preference for a given item in a pair had 
been demonstrated, it was felt that the choice probabilities 
associated with each could be considered to be "equally 
likely" for the purposes of evaluating the information con- 
tent of the choice associated with each pair. 

Three stimulus lists of six pairs each were constructed 
from the "equally likely" subgroup of pairs. These lists were 


assembled under the following constraints: 
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List I: Figures in each pair were as dis- 
Similar as possible. In addition, all 

figures in the entire list were as dis- 

Similar as possible. (Within-pair 

Similarity factors--from Table 2--were at 

least 2.50, averaging 2.60, while between- 

pair factors were no less than 1.75, 

averaging 1.98.) pileor 
List II: Figures in each pair were as 

Similar as possible, but dissimilarity 

between pairs was maintained. (Within- 

pair rating factor was no greater than 

1.95, averaging 1.58; the between-pair 

factors were no less than 1.90, averaging 
DeeAO.) 

List III: Figures were as similar as possible, 
both within each pair and between other figures 
taethe list. (Within® pair -siamalarrty factor 
was no more than 1.90, averaging 1.73; between- 
pair factor was no greater than 2.30, averaging 


io 2.) 


These lists are presented in Figures 2, 3, and 4, respectively. 


As can be seen, the lists were constructed in order to 


present discrimination tasks of increasing difficulty. Sti- 


mulus items in List I were chosen to be as distinguishable 


as possible, minimizing intra- and interpair confusion. Simi- 


larity within pairs was added in List II, but each pair was 
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FIGURE 3. Stimulus List II. 


(Maximum Similarity within pairs; minimum similarity 
between pairs.) 
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kept as distinguishable as possible from other pairs in the 
list. Similarity was extended to cover all items in List 
rit. 

When lists of six pairs each had been completed, 
test stimulus lists of 60 pairs were assembled. Each test 
list consisted of 10 repetitions of each of the six pairs 
@amests L, Il, and III. Construction of the 60-pair lists 
was performed on a pseudo-ramdom basis with the following 
mesa CctLOns: 

(a) Lists were subdivided into 10 replicates, 

each of which contained the basic list of 
Six pairs. Order within these replicates 
was pseudo-random in order to give the 
appearance of overall randomness but still 
maintain discrete groupings of stimuli. 

(b) Left-right order within the pairs was 

varied in a pseudo-random fashion as well, 
but was such that a given item was seen on 
the right five times and on the left five 

times in order to preclude positional cues. 

(c) At least one different pair was presented 

before a given pair was repeated. 

(a) Polygons were not rotated or reversed, but 

were presented "upright" at all times 
(Ammoult, 1954). 


The resultant sets of 60 pairs are listed in Table 
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Table 3 


Stimulus Set Ordering for all Three Stimulus Lists 


Replicate 
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10 
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Note. 


Item designations refer to numbers assigned 


tomsctamulus pairs in Figures 2, 3, and 4. 
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Thus each test subject could be presented a total of 
60 pairs of stimuli. Pairs appeared in no apparent order, 
and the correct response was not always on either the right 
or left side; sugjects were forced to learn the correct 
response in each pair solely on the basis of recognition of 


the items within that pair alone. 


Gee tisot APPARATUS 

Test apparatus was designed to provide maximum flexi- 
peelaty in test Wane tea cions. The apparatus array used in 
administering the test 1s diagramed in Figure 5. Critical 
units of the presentation and response equipment were secured 
in place throughout the course of test administration. Dis- 
tance from the subject (edge of table) to the viewing screen 
was 42.5 inches (107.95 cm); reinforcement lights were 
ieedteed &.5 inches (21.59 cm) in front of the screen. Sti- 
mulus pairs occupied an area on the screen approximately 6 
memes (15.24 cm) high by 9 inches (22.86 cm) wide. 

Stimulus pairs were mounted on 35mm slides, one pair to 
a slide. Since each list was presented a total of 10 times, 
the 60 slides required for each list were placed in a 
carousel. Stimuli were rear projected onto a Kodak shadow- 
box screen using a Kodak Ektographic Carousel slide pro- 
jector, Model B-2. A neutral light-reduction filter (Kodak 
Wratten gelatin filter, no. 96 ND 0.50), rated to reduce 
light transmission by 50 percent, was fixed over the pro- 


jector lens to reduce excessive glare on the screen. 
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A modified Ohr-tronics eight-channel paper-tape reader 
was used to control the reinforcement lights (described below) 
so that only correct responses would receive reinforcement. 
Wiring was accomplished so that the pulse used to advance the 
slide projector to the next stimulus pair also advanced the 
tape reader. Tapes were punched to co-ordinate with the 
ordering of the stimulus list in use. 

The apparatus was designed to permit a machine- or self- 
paced mode of presentation. Stimulus presentation rate in 
the machine-~paced mode was controlled by a Lafayette Model 
5004B timer. The timer was set to provide an actuating pulse 
to both projector and tape reader simultaneously every 4.0 
seconds. The time required for the slide projector to cycle 
from a presented slide to the next slide was found to be 1.0 
sec. Since the projection screen was blank during this cycle 
time, the stimulus pairs were visible for only 3.0 sec before 
the timer initiated the next sequence. Thus an IPR of 1/3 , 
bits per sec was accomplished with 1.0 sec between stimuli. 

Stimulus presentation during the self-paced mode was con- 
trolled by either of two identical buttons located on the 
pmdeenwot the response box. Pressing either of these buttons 
initiated the electrical pulse that advanced the slide pro- 
jector and tape reader. (These buttons were inactivated 
during the machine-paced mode to preclude accidental disrup- 
ifwom of the stimulus presentation rate.) 

Two identical buttons fixed on top of the response box 


were used to designate choices. Correct responses were 
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reinforced by one of a pair of 2.5 watt lights placed ona 
small box directly in front of the viewing screen. Incorrect 
responses received no reinforcement. Responses, regardless 
of reinforcement, were recorded on a two-channel Clevite 
brush recorder, Model Mark 220. The tapes thus obtained 
could be used to confirm observed responses, and in the self- 
paced mode to measure inter-response time and total test time. 

Twenty-eight volt DC current to power the tape reader and 
reinforcement lights was obtained from a Power Designs, Inc., 
Model 3650-S DC Power Supply. 

Simplified schematic representation of the apparatus 
wiring is shown in Figures 6 and 7 for the self-paced and 


machine-paced phases, respectively. 
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FIGURE 6. Block Diagram of Test Equipment in Self-Paced 
Mode. 
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Vee Arn RUMEN TWOs TESTS TRA, 


In order to evaluate the characteristics of the con- 
Structed test under conditions as close to operational as 
possible, and also to investigate the appropriateness of the 
various test parameters (IPR, list length and compositon, 
etc.), it was decided to administer the test to as many sub- 
jects as possible during a five-week period in which they 


would be available. 


Pee 15 T HOD 
if Facilities 
Testing was conducted in November and December, 1973, 
at the Naval Training Center (NTC), San Diego, California. 
All testing was performed in an isolated room at the Personnel 
Testing and Classification Center located on board NTC. Since 
activity was planned for both morning and afternoon periods, 
windows in the Seer room were covered with opaque material 
to reduce anticipated glare from sunlight and to achieve uni- 
Perielignting conditions in the room. 
Zee oub ects 
Subjects tested were 160 male U.S. Navy recruits at 
Mi@es Ages ranged from 17 to 26 years, with the average being 
19 years. Average stated schooling level for the group was 
12th grade (11.78). Schooling level within the nonwhite sub- 
group was slightly higher (12.2 years) than the group average. 


Nonwhite subjects were predominantly Negro, although the 
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sample contained Oriental, Maylay (Filipino), and Mexican- 
American recruits. Subjects were assigned to the various 
moot conditions in order of appearance. 
3. Test Design 

The experiment was conducted in four major phases. 
Forty-four subjects were given the test using self-pacing to 
control the stimulus presentation rate. Stimulus List I was 
used throughout the self-paced phase. The remaining three 
phases were machine-paced to present the stimulus pairs at a 
constant rate of one each 4 secs. A one-second inter-stimulus 
time (cycle time of the projector) thus gave a 1/3 bit-per- 
second IPR. In the three machine-paced phases, 43, 40, and 
Semetlojects were tested using Stimulus Lists I, II, and III, 
meepectively. Tabular representation of this test design is 


SHown in Table 4. 


Table 4 


Test Design 


Test Subjects Stimulus 


Group (White; Nonwhite) Pacing List 
di Ate (3 ii 3) Self I 
2 45° (3.0. 13) Machine I 
3 AQ “(314 -_9) Machine en 
4 33 429-0. 4) Machine TL 
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4. Procedure 

BUbNeCG tS. Were somOUdIG WimtO itae Costing, feocm im licEoups 
of not more than six. The apparatus was displayed, and the 
experimental nature of the testing explained briefly prior 
to issuing the verbal instructions contained in Appendix B. 
Instructions emphasized the nature of the stimuli, what was 
required of the subject in the way of response, and the opera- 
tion of the apparatus itself. Subjects were then given the 
opportunity to ask questions about the test and procedure, 
and to decline participation if they so desired. They were 
then asked to wait outside the room and were brought in for 
testing one by one. The instructions for the test were then 
reviewed with each individual as he was seated at the response 
box prior to commencement of the experiment. 

Stimulus pairs were then presented one by one on the 
viewing screen in the order given in Table 3 for his test 
condition. Each list of six pairs was presented in 10 conse- 
cutive trials with no break between lists. As a subject 
Semeeted the figure in each pair that he thought was correct, 
Memeressed the corresponding (right or left) response button 
in front of him. Correct responses were reinforced by a small 
light in front of the view screen, while incorrect responses 
received no reinforcement. The IPR was determined as 
described above. 

As testing was in progress, the experimenter stood 
behind the subject and recorded his responses on an answer 


sheet. Responses were also recorded electrically on a 





two-channel Brush recorder. Upon completion of the test, the 
subject was cautioned not to discuss anything he had seen or 
done in the test with those who had not yet been tested. 

This request was repeated to the entire group after all had 
been through the test. 

Performances by six of the original 160 subjects were 
discarded. Improper operation of the self-pacing buttons 
that put the tape reader out of phase with the projector was 
cause for rejection of three performances. Another subject 
in the first (self-paced) group was unable to follow instruc- 
tions. Timer malfunction caused two performances in the 
first machine-paced group to be eliminated. 

Seventeen other subjects' performances were not used in 
the data analysis because their BTB scores and/or demographic 
data could not be retrieved from computerized records. Asa 
result of these subject losses, the 137 remaining subjects 
(white and nonwhite) were distributed as follows: Group l 
eget); Group 2 (25, 12); Group 3 (28, 8); and Group 4 


Ore) . 
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ive RESULTS 


4 


Individual performances in the test, in the form of number 
of correct choices made per trial per unit of time, were com- 
puted to arrive at the test measure of effectiveness, Infor- 
mation Processing Rate (IPR). Dimensions of IPR were bits of 
information correctly processed per second. Performances in 
eer st trial were not used, since responses in the initial 
trial were dependent wholly upon chance, and as such were not 
miemeative of learning ability. 

The number correct in each trial was divided by the amount 
of time the stimuli were presented to the subject. (In the 
machine-paced mode, this was a constant 3 sec. per pair. 
Scores for the self-paced group were scaled to individual 
rates.) In both situations, the 1 sec. cycle time (inter- 
stimulus time) of the slide projector was not included in 
Gemeucing IPR. The resultant trial IPR scores were grouped 
into three blocks of three consecutive trials each. These 
figures are listed in Table 5. Rates of processing informa- 
hGH are seen to generally increase over blocks of trials for 
all groups. (The single exception is the nonwhite subset of 
test group Four, where performance declines very slightly over 
eaprhs. This group contained three subjects.) Overall per-— 
formances by all groups were quite similar, despite differ- 
ences in pacing mode and stimulus similarity between groups. 


Overall performance by the nonwhites in test group One 
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(self-paced) exceeded that of the whites; the reverse was true 
Bepetne three machine-paced groups. Figures 8 and 9 depict 
aspects of these situations. 

mie results listed in Table 5 were subjected to an analysis 
of variance using a three-way design compensating for unequal 
Sereemopulations by test group, racial group, and blocks of 
thdals as described by Kirk (1968). This design utilizes the 
harmonic mean to estimate within-cell degrees of freedom. The 
iemermees Of this analysis are presented in Table 6. 

As can be seen in Table 6, Significant effects were noted 
between racial groups and among blocks of trials. Analyses of 
Variance were also conducted uSing a one-way, repeated measures 
design in order to determine the contribution of between- 
subjects variability to the overall error term of Table 6. 
These analyses were run for four groups, established on the 
basis of race and pacing mode, and the results are listed in 
wae 7, Significant pet Cenecubiece and between-blocks 
effects are seen in all groups. 

Table 8 shows the results of an analysis of variance con- 
ducted using a two-way design for unequal cell frequencies 
Ranney 1962) on the four groups established in Table 7. With 
Subjects grouped in this manner, no significant effect is 
femed as a result of pacing mode or racial grouping. Because 
performances by nonwhites were seen to exceed those of whites 
in the self-paced mode and yet lag behind in the machine-paced 
groups, an analysis of variance was conducted using only the 
machine-paced groups. The results of this analysis, setting 


race against stimulus sets, and using the same design to allow 
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Table 6 


Analysis of Variance of Overall Performance by Test 
GEewe, Lacial Group, and Blocks werel:iaiss 


Term dt SS MS FP 
eect ll Ze lee o> JU == — —— 
Test Group (T) 3 SOS Ze. 5.0) 16942 lOs80. 230 nese 
Racial Group (R) 1 31,511200 sly otl200 24.2568 05 
Pepe Block (B) Z 117, 910200) Sco 55. 00 eo 02 se Ou 
IPS a 3 24,396.00 Belg 0 noite nse 
ieee 1B 6 a B10 Fel te) aye (8) eee. 0 OZ sO ierce 
R X B Z 21,346.00 “10 7a 00-1 AS 2 eee 
ie Xx B 6 137 214.200 220230 = 0. 2 9 Oe ioe 
Error 220°) 1 oA G ZO 0EG 0 Toa 7 now == a 
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Table 7 


Analysis of Variance of Overall Performance 
by SubDIJeCE and BlockSeor GTeialc 


Term aft 
Nonwhite, Self-Paced 
Lota | SZ 
Supject 1K 
Block # 2 
Earor 20 
White, Self-Paced 
Total fal 
Subject 23 
Block # 2 
Error 46 
Nonwhite, Mach-Paced 
total 68 
Subject 22 
Block # 2 
BisisOr 44 
Pitee, Mach-Paced 
Total 236 
Subject 78 
Block # 2 
Error i 6 
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Table 8 


Analysis of Variance of Overall Performance 
by Racial Group and Pacing Method 





Term df SS MS E p 
Total 137 465,094.994 — = — 
Racial Grp (R) iL A, ANT .A7 5 © 4, 47 24 7S ee eee 
Pacing Mode (P) 1 BAD eon 2425501) SO Crime 
R XP il 8.772296. 8 7720901 2o0e setae 
beror 134" A550 s662.057 435570 0 ee = — 

Table 9 


Analysis of Variance of Overall Performance 
by Racial Group and Stimulus Set 
(Machine - Paced Only) 


Texm dt SS MS ae: 
I a eee ee eee eee 
Total 102 57 35262925 — —— — 
Racial Grp (R) 1 342.169 342,169 Boece 02e 
Stimulus Set (S) 2 4.9758 2.379 R08 Si es- 
ax S 2 O65 iy AS, 058 “0.95 CRamnwse 
naeirOr 97 4,873.884 50.246 — — 
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for unequal sets, are seen in Table 9. In this case, race is 
seen to be a significant factor, while no apparent difference 
is seen between the performances on each stimulus set, despite 
Eme graded difficulty (similarity) of each list. 

Internal reliability of the test itself was investigated 
using a split-half design for each test group and each racial 
group as well as for overall performances. Processing rates 
were compared for trials 4, 6, and 8 against those of trials 
weer and 9. In addition, scores on the latter group of 
trials were compared with those obtained on trials 6, 8, and 
10. The former comparison will be referred to as "low trials" 
pme@meene latter, as “high trials.” 

Correlation coefficients thus obtained were used in the 
Spearman-Brown formula for split-half correlations. Both the 
raw coefficients and the Spearman-Brown Seeneee seins are 
ieeted in Table 10. A majority of the coefficients are seen 
to be statistically significant. 

The relationship between scores on the experimental test 
promiene traditional methods of measuring Navy recruit poten- 
tial was investigated using the test subjects' scores on the 
Memegecencral Classification Test (GCT), a major portion of 
the standard Basic Test Battery (BTB). The basis for the GCT 
ieee win verbal ability, since the test consists of sentence 
completions and verbal analogies. Test scores are scaled on 
a normalized distribution with a mean of 50 and a standard 
deviation of 10. Performance on the Arithmetic Reasoning Test 


(ARI) is often combined with GCT scores to obtain a rough 
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Table 10 


Split-Half Reliability Coefficients 


Low Trials High Trials 
ps (468 vs 579) (579 vs 6810) Totals 
FOP miiraw) ey Eio—B) (raw) gee (ob) Low High 
nina te ASG .868** 13 .832*% 
.865**% oye 
Nonwhite 756 .861** . 864 ~927** 
White - 800 .889** . 865 ~928%** 
A teh ff dhs ce 202i 
Nonwhite . 700 .824*= . 826 noUSS = 
White 6155 oma 7632 ) 7 ee 
eo woo 
Nonwhite OU eo wt, WoS5 Oo 
White ~674 .805** 664 ~1/98** 
.802** By oye 
Nonwhite Posy 2776 .610 my es: 
Totals 
White OS aes -o4o 
Nonwhite .788** ,Ovo | 


Combined nolan? Ae oo lai Fe spe 1[2) 003 


i eT 


Seiguiticant at p < .05- 


Seesacgniaticant at p < .Ol. 
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"multiple" used in determining Navy technical school eligi- 
Prlaty and aptitude. 

Pearson product-moment correlations were computed between 
Beat scores and GCT scores obtained from individual service 
Files. These correlations were determined for racial sub- 
groups of subjects falling below and above the GCT mean score 
of 50, for both racial groups in toto, and for the entire 
sample. These figures are seen in Table 11. Significant 
weaies OL the correlation coefficient are noted only in the 
white group as a whole and for the entire sample. Nonwhite 
Bece Scores did not correlate significantly with GCT 


performance. 


Table ll 


Correlations of Test Performance (IPR) with Navy 
General Classification Test (GCT) Score 


Group Averages Correlation Coeffierenr 
GCT JiR GCT GRP Race GRP Total 
Low (<50) 42.67 720s Peels 
N=24 
Nonwhite 
a AS 
N=33 
ee Stile ue A (S169) 562289 a207 mod 
N=9 
— per ad Os 2 
Low (<50) 42.18 PAST; s206 
N=17 
N=104 
a High (250) 59.63 .238 050 
N=o7 
Potdniricapteat. <— GUon 


**Significant atp = .0: 
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Additional insight into the relationship between general 
(nonverbal) learning ability and verbal intelligence was 
obtained through the use of linear regression techniques. 
Figures obtained for the full sample, as well as the white 
and nonwhite subgroups are listed in Table 12. A further 
breakdown of the nonwhite subgroup into self-paced and 
machine-paced units is displayed in Table 13. These rela- 
tionships are graphically presented in Figures 10 and 1l. 

It should be noted that the number of scores used in 
these investigations involving GCT scores was one less than 
that used in previous calculations. The GCT score of one 
nonwhite subject could not be obtained from computerized 
records. Therefore, the number of nonwhite subjects was 


meauced from 34 to 33 for these calculations alone. 
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Table 12 


Linear Regression of Test Performance (IPR) Against 
Navy General Classification Test, (CCl = ccore 


IPR (X) (Cen Correlation i= 
Group Mean S.D. Mean S.D. Coefficient Slope Intercept 


White W255 wee 049), (562 76mg s meee 28 47.98 
(N=104) 
Memwhiite .208 .081 46.55 7.87 Males: 2 AD es 
(N=33) 
@emorned .227 .059 54.31 9.30 2 42 Aa Fe 
a= 137) 
Moegniticant at p < .05. 


Geetognificant at p < .Ol. 


Table 13 
Linear Regression of Test Performance (IPR) Against 


Navy General Classification Test (GCT) Score 
by Pacing Mode (Nonwhites Only) 


i SS 





Ses poe oe Correlation Ye 
Pacing Mean S.D. Mean S.D. Coefficient Slope Intercept 
Seut per 258 44526 82 ~154 Jbak 42.04 
ae) 
Meeqme .201 .051 47.64 7.47 -401 50 Se 7 
(N=22) 
Combined .208 .081 46.55 7.87 Bs: ae AZ 
i= 23) 
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FIGURE 10. Linear Regression of Test Performance (IPR) Against 
Navy General Classification Test (GCT) Score. 
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VoeevlsecU sole OF RESULTS 


A. GENERAL 

Initial inspection of the experimental results shows that 
learning did, in fact, take place in the course of the test. 
In all test and racial groups but one the amount of informa- 
tion processed increased as the trials progressed. The 
Significance of this increase was confirmed by analysis of 
variance. The only group ae Gae Nee Gucplay this leaning 
Beecet was Considered £O be too small (N = 3) to provide any 
sort of conclusive evidence. 

The machine-paced mode of information presentation neces- 
sarily limited the maximum possible IPR to that of the pre- 
sentation rate (.333 bits/sec). Comparison of the IPR's 
obtained during the third block of trials shows that whites 
attained a maximum of 80 percent of this "perfect learning" 
rate, while nonwhites reached 69 percent of this quantity. 
Inspection of the IPR's for the three blocks shows that the 
greatest marginal learning occurred during the initial trials 
with the increases on the remaining trials becoming propor- 
tionally smaller. This effect corresponds to the classical 
"learning curve " of grouped data, showing rapid gains early 
in the testing, with learning eee tapering off as trials 
are continued. That this effect was present in this case may 
bewdue, at least in part, to generalization of stimulus cues 
between pairs. This factor is suggested in literature on 


verbal learning tasks (Gibson, 1942, 1959). 
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At the very least, even though some individuals did accom- 
peeestt “perfect learning” well before the last trial, the test 
was not continued until all, or even nearly all, subjects had 
learned the correct response in all six pairs. Thus the time 
to perfect learning is not known from these results. Of 
greater value, however, is the ability to differentiate 
between individual performances over the same test arco.) Such 
differences are, as a rule, more apparent at the intermediate 
stages than toward the "flatter" end of the learning curve. 
Thus, stopping the testing at the tenth trial, while a some- 
what arbitrary decision when made, was a better point than the 
perfect learning point, although it may well not have been the 
Geeimal point at which to terminate. 

Some caution should also be exercised in interpreting the 
experimental data due to restriction of range in the test 
sample's measured abilities. Because the testing was accom- 
plished using men who had already been inducted into the Navy, 
the population from which the sample was drawn represented a 
Puemeel ected group, 80 percent of whom were elugspblestorstech- 
nical school training upon entry. These recruiting standards 
ensure that the majority of recruits accepted into the Navy 
represents at least the 50th percentile of the service-age 


population in general 


B. SELF-PACING vs. MACHINE-PACING 
No significant difference in IPR was noted between the 


self-paced and machine-paced groups. This is notable in light 


Spl 





@merene fact that the self-paced group was under no constraint 
to reach a set or specified rate. These findings tend to sup- 
Weme the findings of Arima and Gray (1972), namely that IPR is 
not affected by presentation rate. An advantage of the self- 
pacing design is that it allows a superior performer to seek 
his own level of accomplishment without being limited by a set 
presentation rate. In the course of testing, the performances 
of two nonwhites and four whites in the initial group exceeded 
the machine-paced groups' presentation rate of 1/3 bit per sec. 
Another effect noted in this case was that nonwhites performed 
on a par with (actually, somewhat better than) the whites in 
the self-paced situation, but did significantly poorer when 
stimuli were presented at a fixed rate. It was in this aspect 
that the test proved to be "culture-fair." 

Self-pacing would appear to be a better choice for this 
type of test, in that the widest range of test scores 1S pos- 
Sible. This feature, coupled with the lack of cultural bias 
noted above, Beate that this would be a useful selection 
tool in that individual differences would be made more 
apparent. It is also possible that examination of test-taking 
Strategies ina self-paced situation might give valuable 
insight into elements of individual personality and motivation. 
In short, a test with a fixed presentation rate can answer the 
guestion, "How much was learned?" A self-paced test can find 
out, "How much was learned, and how fast was it learned?" This 
ability per se renders such a test the more useful of the two 


(Estes, 1974). 
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Disadvantages to self-pacing are seen to be twofold. For 
one, there is no theoretical upper limit to the scores. This 
is not seen as a great problem. The other drawback is seen 
to be the greater variability apparent in the self-paced 
scores in comparison to the machine-paced performances. 
Again, however, this 1S not seen aS a serious problem. In 
fact, it may be indicative of greater differentiation between 


individuals accomplished in the course of testing. 


GeaeeolLiLMULUS LISTS 

Despite the fact that three stimulus lists were constructed 
to give graduated degrees of similarity between items, and thus 
graduated difficulty, performances on the three lists were not 
Significantly different. An implication of this effect is 
that construction of equivalent stimulus lists is made simple. 
Explanation of this phenomenon is not quite so easy. A possi- 
bility is that, since the lists were all two-choice situations, 
the basic informational aspects of the choice itself prevailed, 
namely that the information content of the stimulus lies in the 
number of alternatives presented, rather than the information 
pemeant or relative similarity of the figures themselves. It 
is also possible, however, that the actual discriminations made 
by the subjects during testing were made on a much finer level 
than were the original judgments of similarity. Thus, if each 
figure was seen as distinct from the others at the outset, the 
confusing effect of intra- and interpair similarity was nulli- 


ied |On this it 1S imeemeant to note@ehat the originadthirty 


By) 





figures had been selected for their "discriminability" in the 
pec oy Arnoult (1956).: Thus, even though the three lists 
had been scaled for Similarity, the figures in the lists were 
apparently not so similar as to make ready discrimination 
ieee icult. 

It is interesting to note here that several tests of 
verbal discrimination ability (Arima & Gray, 1972; Baltutis, 
ive Bugarin, 1973) used word (stimulus) lists containing 
migh and low similarity items, but found that this had no 
effect on IPR. All the words used occurred very frequently 
in normal language use, and as such were readily distinguished 


Seep ioentified per se. 


Pee SP LATTONSHIP OF JPR TO MEASURED INTELLIGENCE 

Direct correlation between general learning ability (IPR) 
and measured verbal intelligence (GCT) was seen to be small 
for both racial groups. Internal reliability was shown to be 
strong, and as such was eliminated as a possible reason for 
this low correlation. In addition, sample scores were 
inspected to determine if restriction of range had been an 
Mereiicipatea factor. Table 14 shows, however, that this was 
not the case. Since the aim of this study was to develop a 
test of an area not measured by current, verbal-oriented 
tests, the lack of strong correlation between subjects' per- 
formances on the two tests indicates that the instruments do, 
maaecad, measure different abilities. Validation of the experi- 
mental test as a predictor of on-job performance is a task 


that does not lie within the scope of this study, but which 
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Ranges of Test Performances 


Table 14 


GCT 
Group 


(20) 
(250) 
(<50) 


(>50) 


Low High Range 
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nonetheless is essential to determine the exact nature of the 


test's value in the selection process. 


Pee lle LLEGENCE, TEST PERFORMANCE, AND RACIAL DIFFERENCES 

Analysis of overall performance during the test showed 
that, aside from the "learning effect" discussed above, only 
racial differences proved to be a Significant factor in learn- 
ing performance. This factor was further isolated to the 
machine~paced test groups only, where white performance 
exceeded nonwhite performance; the reverse was true for the 
self-paced group, although the difference in this case was not 
Significant. It would seem that the pacing mode of the test 
itself affected the performance of the nonwhite subjects. 

While no concrete theory is advanced here to explain this 
effect, it 1s possible that the pressure of machine-paced pre- 
sentation made the nonwhite subjects more anxious, and thus 
less able to perform up to their true ability. fThis hypothe- 
Sis is suggested by studies by Taylor and Spence (1952) and 
by Ramond (1953), in which subject anxiety was found to have a 
detrimental effect on performance in serial and verbal learn- 
ing tasks. 

In the light of in-house findings of racial test bias 
Peeenin the Navy BTB (Stephan, 1973; Thomas, 1972c), the 
results of a 2 X 2 Chi-square test of GCT score distribution 
by racial group within the test sample are of interest. When 
the sample was divided into four groups by race (white, non- 
Wiese) and: GCE score (bellow the mean Score of 50, above 50), 


tnemicelis showed 87 whites with GCT scores of 50 or above, 1/7 
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below 50, 9 nonwhites with scores 50 or higher, and 27 non- 
whites below the GCT mean. This distribution yielded a Chi- 
eeeecmotatiotice (dt) = 1) Of 35733, which 1s Significant at 
the .001 level of confidence. The men in the test sample 
were by no means evenly distributed across the GCT score 


range with respect to racial lines. 


PS TESTING CONSIDERATIONS 

Internal reliability of the test itself was established 
uSing the split-half technique and employing the Spearman- 
Brown coefficient. Reliability coefficients were, in aeneeal, 
overwhelmingly strong, especially when the total testing time 
(4 min.) 1S considered. In order to establish the validity 
of the test in predicting on~job training performance, com~ 
prehensive follow-up study is necessary. Most Navy classifi- 
cation tests in use today show strong predictive capability 
for school performance (Thomas, 1972a and 1972b), but are not 
validated against on-job performance. 

While the apparatus assembled for this study was net 
practical for use in an operational setting, the test mate- 
raigule are readily adaptable to existing teaching machines, 
many of which could provide on-the-spot scoring as well. The 
test concept is also highly compatible with computer-based 
teaching systems, such as PLATO. Such a system would permit 
testing of potential recruits at remote terminal sites con- 
nected to a central computer system that would administer, 


score, and evaluate performances in a real-time setting. 
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VI. CONCLUSIONS 


Conclusions that may be drawn from the results of this 
study are as follows: 

1. Learning did, in fact, take place during the course of 
the administration of the experimental test. 

Ze Termination of testing prior to the “perfect learning" 
point is considered to give greater differentiation between 
individual performances, but Ele point chosen (after 10 trials) 
may Or may not be optimal. 

3. The rate at which information was processed by the 
self-paced group was not significantly different from that of 
the machine-paced groups, indicating that IPR is, in many 
cases, independent cf presentation rate. 

4. The self-paced task enables a superior performer to 
attain his own level without being limited by presentation 
1s eSe 

5. No significant difference was seen between white and 
nonwhite performance in the self-paced task, but this was not 
true in the machine-paced situation, where white scores 
ceeded nonwhite. | 

6. Intra- and interpair similarity was not a factor in 
learning performance, suggesting that item similarity using 
these random shapes may not be important in constructing 
equivalent stimulus lists. 

ie ePerrormance On the experimental test did not correlate 
highly with performance on standard verbal-oriented intelli- 


gence tests (Navy GCT). 
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Appendix A 
SUBSECTS FINSERUCTIONS —— EXPERIMENT Ff 


Three columns of pairs of two-dimensional shapes are 
listed on the following pages. In each case, one shape has 
been arbitrarily selected as "correct." Selection of the 
"correct" shape in each pair was made without regard to any 
of the other selections in anes pairs. That is, no syste- 
matic procedure was used. 

it) Go through the entire list, pair by pair, and cixrele 

the shape in each pair which you think is the one 
that. was Selected as "COnrrEGEE. * 
m2) Go through the entire list again, this time writing 
in the space between the members of each pair a 
numer One (1), two (28 Vor tireen(s), as follows: 
"1" - If the two shapes of the pair appear to 
you to be very similar. 
"2" ~- If the two shapes of the pair appear 
GOmyeulGO be vonly Sie meh 9s ainaelcues 
"3" - If the two shapes of the pair appear 
to you to be dissimilar. 
When you have completed these tasks, return this booklet 


fPemevout Instructor. Thank you for your time. 
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Appendix B 
SUBJECTS' INSTRUCTIONS — EXPERIMENT II 


T am asking you to volunteer to take an experimental test. 
T£ you do volunteer, the test itself will take about four 
minutes of your time. This test is completely experimental-- 
moening will ever get into your training jacket here at NTC, 
nor into your service record. You will be helping me to "test 
out the test." 

In a few minutes, I will be showing you a lot of pairs of 
black shapes on a white background. These shapes were drawn 
by a computer, and they are not supposed to look like or mean 
@ieeming in particular. The shapes will appear on this screen 
two by two. One of the shapes in each pair is one that I have 
decided to call the correct answer; the other shape 1S a wrong 
answer. When you first see a pair of the figures, I want you 
to guess which one is the one I called "correct." You will 
give your answers by pressing these two buttons. If you think 
the right-hand shape is "corect," press ‘the right-hand button. 
If you think it's the left one, press the left button. If 
you guessed correctly, a light at the bottom of the screen 
will come on. If you made the wrong guess, no lights will 
light up. You will see each pair of shapes more than once. 
The first time a pair appears, you will be guessing at the 
correct shape, but I want you to try to remember which shape 


is correct, so you can make the right answer without guessing 


66 





the next time you see the pair again. There are a lot of dif- 
ferent pairs, so this is a tough test. Don't worry if you 
don't seem to be getting a lot of right answers; just do your 
best and try to remember which shape is the correct one in 
each pair. Make only one answer each time; if you didn't get 


feecaight, you know it's the other one. 


peeoetAlL INSTRUCTIONS FOR SELF-PACED GROUP: 

When you have made your answer, press one of these buttons 
on the sides of the box to go on to the next pair. Work as 
Pmomeas YOU Can, but don't rush it and don't just go through 


the whole time guessing. 


SeeECriAL INSTRUCTIONS FOR MACHINE-PACED GROUP: 

The pairs will come on the screen every few seconds, so 
you don't have a lot of time to decide which one is correct. 
Me@@eaill have plenty of time €o get a good look at the shapes, 
make your decision, and make your answer. The pairs will be 


eommume Screen the same amount of time each time. 


REMAINING INSTRUCTIONS FOR ENTIRE SAMPLE: 

Ferenber that there are a lot of different pairs, but 
that you will see each pair several times. The correct shape 
in each pair will always be the correct answer, but will some- 
times be on the right-hand side and sometimes on the left. 
It will always appear with the same shape. The shapes will 
not be turned around or flipped over--they will always be the 


Same way as you first saw them. 


67 





Appendix C 


GC io CORES AND ibot  —PenwORMANCE BY “fEST GROUP 


PE St GROUP 1 


JES. 
Subject GCT Bigelk 1 BLOCK Z Bicek. 3 

il 58 388 Sis 410 

2 59 26). econ Se) 

3 48 1s) 176 ey) 

* 4 40 76 tS 2 SY 
5 GZ 216 288 306 

6 57 70 sg (0) 70 
a 41 181° 231 230 
8 58 239 279 259 

2 55 ae 5 Zao 219 
180) 40 eal eZ Se 
SIL IL 42 25 187 200 
*12 46 160 Lent 203 
als! 62 156 7s 13 
*14 40 180 246 197 
1S Gul! Vir 308 308 
16 53 138 sy 26 
ey 38 204 204 238 
a Te! SZ 248 248 Vi page 
*19 47 405 405 405 
20 58 eZ 3 M35 197 
Zale 60 Aa 223 S70 
Ce 70 wg, 159 706 
23 60 is: 97 161 
24 59 56 156 242 
UIA 49 426 503 Syextl 
26 26 208 208 340 
ii al 130 162 260 
28 We a 107 85 
29 63 169 192 205 
310 39 90 sL IL 90 
oh! 58 143 ei, 214 
32 55 seas Zier 306 
28 60 187 187 214 
34 43 182 oS 265 
35 41 180 180 ab) I: 


Note: (1) IPR is presented in bits/sec X 10?. 


(2) Nonwhite subjects designated by *. 
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TEST GROUP 2 


IPR 
Sub vec evens sleyel< AE Balocke.Z Bikotie y 3 

pape oy, 148 240 240 
2 56 240 335 35.5 
cca 44 166 240 296 
x 4 52 166 166 148 
5 Sil ESD 24a 27 

6 65 185 240 Bee 

7 65 205 296 335 
as: 40 IP) DE IES. 20 
g 57 ZO 314 314 
10 45 Gr IS INS) 
alk 53 240 314 296 
PaleZ 69 257 Soh B3e 
is Do IAS S. 256 296 
14 33 35 240 296 
Abs) 46 203 2 296 
16 310) 202 314 So5 
17 sah 277 29 296 
18 54 oi 7: Ne © 122 
as, ou 222 314 270 
20 65 296 296 355 
iD JI 46 129 208 259 
ae 48 ceseal 166 IES 
*23 ai! ae 148 Zoe 
24 oo) L385 206 1x5) © 
25 a5 23 a 222 
*26 42 ee 129 148 
oY) i 45 240 166 203 
28 61 2a2 314 a3 
BLS, 44 Wee LAs Jeb 
30 46 Ce PRI) UI DIF 
Suk 63 185 274 314 
32 60 222 296 296 
ud 63) 40 166 222 333 
34 40 148 92 222 
aS 56 291 336 314 
36 65 129 240 Zid 
oF 66 Lbeeos, 314 29 7 


Nowe: \ (1) SPR cepresecutce tnebresc/sec X10: 


(2) Nonwhite subjects designated by *. 
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Test eGROUP 3 


282 
Subject GCT Block 1 Bleck 2 Bock 3 

i 66 185 148 5 

Z 61 185 185 240 
oS 40 240 240 20s 
4 61 166 Mies 240 
oS) 47 148 240 185 
6 61 iL tsys: 259 314 

7 67 250 EAS} 314 

8 46 202 240 240 
se 42 166 BS 185 
IG, 59 240 259 20 
11 65 203 259 MAY 
eZ 68 185 240 314 
13 58 148 259 277 
14 59 166 259 240 
LS 46 222 240 240 
16 38 Les 240 222 
1, G2 hes 240 185 
18 Ve 148 166 222 
19 Sy, PAOD Pi 166 240 
20 56 240 240 240 
Pe A\ 63 208 240 259 
22 or) 185 218 Bee 
23 ays: 148 240 240 
24 44 129 Ze 296 
e275 59 240 Px) &: 296 
26 62 ies 185 240 
27 75 166 222 166 
28 60 229 314 338 
*29 42 240 166 129 
* 30 49 148 129 sis 
Syl 59 166 314 314 
B72 54 185 259 296 
25 63 Les S25 314 
34 5 240 314 314 
B80 63 166 Tes 296 
56 58 240 296 314 


NeGers (1) sPPRets presented immebtes, seer 0 = 


(2) Nonwhite subjects designated by *. 
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TEST GROUP 4 


ede 
Subject GCT Biceiel Brocic 2 Biocice.3 

ab 61 Zee 314 px ef! 

2 58 166 hes 240 

3 52 129 Ze, ZG) 

4 13 125 233 314 

5 62 SS 240 Ze 

6 65 148 240 ZO 

7 pS, 129 205 Pa OAV 

8 59 ites 259 240 

9 69 296 335 208 
16 by) 166 240 236 
Idk He 240 — 296 Les! 
lee 56 Zoe 314 533 
ilies owe 166 207) 222 
14 50 ime 185 220 
= ES -- 296 259 205 
16 61 259 259 207 
14 64 166 Zo2 2 
18 54 129 240 Zac 
19 63 203 222 eS 
20 56 148 259 256 
21 47 294 240 ZG 
a 50 J dp2 3 lea 
Ze 66 ZU 185 203 
*24 52 259 259 314 
25 2 203 203 277 
26 48 ie Ze 247 
Ze, 52 203 203 Zee 
28 60 222 314 314 
Zo 58, ILS, 166 205 
30 53 Wess 240 296 


Note mij. Ue Rws soresenteccmia bitsy secs wane. 


(2) Nonwhite subjects designated by *. 
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